Search CORE

58 research outputs found

Statistical mechanics of RNA folding: importance of alphabet size

Author: A.D. Ellington
B.M.R. Stadler
Chao Tang
E. Shakhnovich
E.L. Kussell
Eldon Emberly
H. Li
H. Li
H. Li
H.S. Chan
I. Tinoco
I.L. Hofacker
J. Miller
J.S. McCaskill
L.F. Landweber
M. Zuker
Ned S. Wingreen
R. Bundschuh
R. Bundschuh
R. Mélin
Ranjan Mukhopadhyay
S. Govindarajan
S.Y. Le
W. Fontana
W. Fontana
Publication venue: 'American Physical Society (APS)'
Publication date: 04/08/2003
Field of study

We construct a minimalist model of RNA secondary-structure formation and use it to study the mapping from sequence to structure. There are strong, qualitative differences between two-letter and four or six-letter alphabets. With only two kinds of bases, there are many alternate folding configurations, yielding thermodynamically stable ground-states only for a small set of structures of high designability, i.e., total number of associated sequences. In contrast, sequences made from four bases, as found in nature, or six bases have far fewer competing folding configurations, resulting in a much greater average stability of the ground state.Comment: 7 figures; uses revtex

arXiv.org e-Print Archive

Crossref

Louse (Insecta : Phthiraptera) mitochondrial 12S rRNA secondary structure is highly variable

Author: Billoud B.
Collins L.J.
Corpet F.
Critchlow D.E.
Day W.H.E.
Fontana W.
Gutell R.R.
Hafner M.S.
Hafner M.S.
Hickson R.E.
Hickson R.E.
Hofacker I.L.
Houde P.
Johnson K.P.
Johnson K.P.
K. P. Johnson
Konings D.A.M.
Lenhof H.-P.
Lockhart P.J.
Mindell D.P.
Moran N.A.
Page R.D.M.
Page R.D.M.
Page R.D.M.
R. Cruickshank
R. D. M. Page
Shao R.
Simon C.
Springer M.S.
Stoye J.
Swofford D.L.
Wheeler W.C.
Publication venue: 'Wiley'
Publication date: 01/01/2002
Field of study

Lice are ectoparasitic insects hosted by birds and mammals. Mitochondrial 12S rRNA sequences obtained from lice show considerable length variation and are very difficult to align. We show that the louse 12S rRNA domain III secondary structure displays considerable variation compared to other insects, in both the shape and number of stems and loops. Phylogenetic trees constructed from tree edit distances between louse 12S rRNA structures do not closely resemble trees constructed from sequence data, suggesting that at least some of this structural variation has arisen independently in different louse lineages. Taken together with previous work on mitochondrial gene order and elevated rates of substitution in louse mitochondrial sequences, the structural variation in louse 12S rRNA confirms the highly distinctive nature of molecular evolution in these insects

CiteSeerX

Crossref

Enlighten

Control of Cognate Sense mRNA Translation by cis-Natural Antisense RNAs.

Author: Deforges J.
Gadekar V.P.
Hart-Smith G.
Hofacker I.L.
Iseli C.
Jacquet P.
Poirier Y.
Reis R.S.
Sheppard S.
Tanzer A.
Xenarios I.
Publication venue: 'American Society of Plant Biologists (ASPB)'
Publication date: 01/05/2019
Field of study

Cis-Natural Antisense Transcripts (cis-NATs), which overlap protein coding genes and are transcribed from the opposite DNA strand, constitute an important group of noncoding RNAs. Whereas several examples of cis-NATs regulating the expression of their cognate sense gene are known, most cis-NATs function by altering the steady-state level or structure of mRNA via changes in transcription, mRNA stability, or splicing, and very few cases involve the regulation of sense mRNA translation. This study was designed to systematically search for cis-NATs influencing cognate sense mRNA translation in Arabidopsis (Arabidopsis thaliana). Establishment of a pipeline relying on sequencing of total polyA <sup>+</sup> and polysomal RNA from Arabidopsis grown under various conditions (i.e. nutrient deprivation and phytohormone treatments) allowed the identification of 14 cis-NATs whose expression correlated either positively or negatively with cognate sense mRNA translation. With use of a combination of cis-NAT stable over-expression in transgenic plants and transient expression in protoplasts, the impact of cis-NAT expression on mRNA translation was confirmed for 4 out of 5 tested cis-NAT:sense mRNA pairs. These results expand the number of cis-NATs known to regulate cognate sense mRNA translation and provide a foundation for future studies of their mode of action. Moreover, this study highlights the role of this class of noncoding RNAs in translation regulation

Serveur académique lausannois

Simultaneous alignment and folding of protein sequences

Author: A. Caprara
B.E. Shakhnovich
C.B. Do
C.B. Do
D. Frishman
D. Sankoff
D.H. Mathews
G. Raghava
I.L. Hofacker
J. Selbig
J. Waldispuhl
J. Waldispuhl
J.H. Havgaard
L.R. Forrest
M. Brudno
M. Cline
M. Lomize
M. Menke
P. Bradley
P. Fariselli
P. Rice
R. Backofen
R. Doolittle
R.A. Sutormin
R.C. Edgar
R.C. Edgar
R.C. Edgar
R.L.J. Dunbrack
S. Henikoff
S. Will
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2009
Field of study

Accurate comparative analysis tools for low-homology proteins remains a difficult challenge in computational biology, especially sequence alignment and consensus folding problems. We presentpartiFold-Align, the first algorithm for simultaneous alignment and consensus folding of unaligned protein sequences; the algorithm’s complexity is polynomial in time and space. Algorithmically,partiFold-Align exploits sparsity in the set of super-secondary structure pairings and alignment candidates to achieve an effectively cubic running time for simultaneous pairwise alignment and folding. We demonstrate the efficacy of these techniques on transmembrane β-barrel proteins, an important yet difficult class of proteins with few known three-dimensional structures. Testing against structurally derived sequence alignments,partiFold-Align significantly outperforms state-of-the-art pairwise sequence alignment tools in the most difficult low sequence homology case and improves secondary structure prediction where current approaches fail. Importantly, partiFold-Align requires no prior training. These general techniques are widely applicable to many more protein families. partiFold-Align is available at http://partiFold.csail.mit.edu

CiteSeerX

DSpace@MIT

Crossref

RFMirTarget: A Random Forest Classifier for Human miRNA Target Gene Prediction

Author: A. Liaw
C. Xue
D.P. Bartel
I.L. Hofacker
J. Liu
J.G. Betancur
J.R. Lytle
L. Breiman
L. Breiman
M. Yousef
P. Jiang
P. Maziére
R. Batuwita
R.C. Lee
S. Bandyopadhyay
S.K. Kim
T.M. Witkos
X. Chen
Y. Zhang
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2012
Field of study

Abstract. MicroRNAs (miRNAs) are key regulators of eukaryotic gene expression whose fundamental role has been already identified in many cell pathways. The correct identification of miRNAs targets is a major challenge in bioinformatics. So far, machine learning-based methods for miRNA-target prediction have shown the best results in terms of specificity and sensitivity. However, despite its well-known efficiency in other classifying tasks, the random forest algorithm has not been employed in this problem. Therefore, in this work we present RFMirTarget, an efficient random forest miRNA-target prediction system. Our tool analyzes the alignment between a candidate miRNA-target pair and extracts a set of structural, thermodynamics, alignment and position-based features. Experiments have shown that RFMirTarget achieves a Matthew’s correlation coefficient nearly 48 % greater than the performance reported for the MultiMiTar, which was trained upon the same data set. In addition, tests performed with RFMirTarget reinforce the importance of the seed region for target prediction accuracy

CiteSeerX

Crossref

RNA Folding Algorithms with G-Quadruplexes

Author: A. Arora
A. Bugaut
A. Guédin
A. Guédin
A. Joachimi
A. Verma
A.K. Todd
A.Y. Zhang
B. Luke
C.B. Do
C.T. Lauhon
D. Gomez
D.H. Mathews
D.H. Zhang
D.H. Zhang
G.G. Jayaraj
H.M. Wong
I.L. Hofacker
I.L. Hofacker
J. Eddy
J. Gros
J.D. Beaudoin
J.E. Johnson
J.L. Huppert
J.L. Huppert
J.S. McCaskill
K. Ito
K. Paeschke
L. Menon
M. Bensaid
M. Webba da Silva
M. Wieland
M. Zuker
M.J. Morris
O. Kikin
O. Stegle
P. Flajolet
P. Schuster
R. Lorenz
R.E. Bruccoleri
S. Kumari
S.H. Bernhart
U. Mückstein
Y. Zhao
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2012
Field of study

Crossref

A Combinatorial Framework for Designing (Pseudoknotted) RNA Algorithms

We extend an hypergraph representation, introduced by Finkelstein and Roytberg, to unify dynamic programming algorithms in the context of RNA folding with pseudoknots. Classic applications of RNA dynamic programming energy minimization, partition function, base-pair probabilities...) are reformulated within this framework, giving rise to very simple algorithms. This reformulation allows one to conceptually detach the conformation space/energy model -- captured by the hypergraph model -- from the specific application, assuming unambiguity of the decomposition. To ensure the latter property, we propose a new combinatorial methodology based on generating functions. We extend the set of generic applications by proposing an exact algorithm for extracting generalized moments in weighted distribution, generalizing a prior contribution by Miklos and al. Finally, we illustrate our full-fledged programme on three exemplary conformation spaces (secondary structures, Akutsu's simple type pseudoknots and kissing hairpins). This readily gives sets of algorithms that are either novel or have complexity comparable to classic implementations for minimization and Boltzmann ensemble applications of dynamic programming

arXiv.org e-Print Archive

HAL-CentraleSupelec

CiteSeerX

Crossref

INRIA a CCSD electronic archive server

Hal-Diderot

HAL-Polytechnique

HAL-Rennes 1

Detecting the Dependent Evolution of Biosequences

Author: A. Coventry
A. Siepel
A.K. Ramani
C.S. Goh
D. Barker
D. Bernardo di
D.D. Pollock
D.P. Wall
E. Rivas
H.B. Fraser
H.F. Noller
I.K. Jordan
I.L. Hofacker
J. Felsenstein
J. Felsenstein
J.S. Pedersen
M. Hasegawa
M. Lynch
M. Pagel
S. Ohno
S. Washietl
S. Washietl
S.R. Eddy
Z. Yang
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2006
Field of study

A probabilistic graphical model is developed in order to detect the dependent evolution between different sites in biological sequences. Given a multiple sequence alignment for each molecule of interest and a phylogenetic tree, the model can predict potential interactions within or between nucleic acids and proteins. Initial validation of the model is carried out using tRNA sequence data. The model is able to accurately identify the secondary structure of tRNA as well as several known tertiary interactions

Crossref

eScholarship - University of California